Conversation
maleadt
approved these changes
Apr 1, 2026
Member
maleadt
left a comment
There was a problem hiding this comment.
Doesn't it need an additional empty line too?
Contributor
There was a problem hiding this comment.
CUDA.jl Benchmarks
Details
| Benchmark suite | Current: 83bb0bb | Previous: e4ac81a | Ratio |
|---|---|---|---|
array/accumulate/Float32/1d |
100802 ns |
101073 ns |
1.00 |
array/accumulate/Float32/dims=1 |
76335 ns |
76196 ns |
1.00 |
array/accumulate/Float32/dims=1L |
1591571 ns |
1585166 ns |
1.00 |
array/accumulate/Float32/dims=2 |
143414.5 ns |
143846 ns |
1.00 |
array/accumulate/Float32/dims=2L |
659728 ns |
657343 ns |
1.00 |
array/accumulate/Int64/1d |
118579 ns |
118428 ns |
1.00 |
array/accumulate/Int64/dims=1 |
79662 ns |
79813 ns |
1.00 |
array/accumulate/Int64/dims=1L |
1706539 ns |
1706332.5 ns |
1.00 |
array/accumulate/Int64/dims=2 |
156463 ns |
155958.5 ns |
1.00 |
array/accumulate/Int64/dims=2L |
961613 ns |
961689 ns |
1.00 |
array/broadcast |
20483 ns |
20223 ns |
1.01 |
array/construct |
1256.8 ns |
1268 ns |
0.99 |
array/copy |
18309 ns |
18010.5 ns |
1.02 |
array/copyto!/cpu_to_gpu |
213802 ns |
214386 ns |
1.00 |
array/copyto!/gpu_to_cpu |
280741 ns |
282599 ns |
0.99 |
array/copyto!/gpu_to_gpu |
10796 ns |
10725 ns |
1.01 |
array/iteration/findall/bool |
134912 ns |
133957 ns |
1.01 |
array/iteration/findall/int |
149881 ns |
148817 ns |
1.01 |
array/iteration/findfirst/bool |
81424 ns |
80695 ns |
1.01 |
array/iteration/findfirst/int |
83047 ns |
82681 ns |
1.00 |
array/iteration/findmin/1d |
85620 ns |
85081 ns |
1.01 |
array/iteration/findmin/2d |
116755.5 ns |
116308 ns |
1.00 |
array/iteration/logical |
200519.5 ns |
196867 ns |
1.02 |
array/iteration/scalar |
67232 ns |
66869 ns |
1.01 |
array/permutedims/2d |
52311 ns |
51940.5 ns |
1.01 |
array/permutedims/3d |
52604 ns |
52252 ns |
1.01 |
array/permutedims/4d |
52304 ns |
51176 ns |
1.02 |
array/random/rand/Float32 |
12898 ns |
13404 ns |
0.96 |
array/random/rand/Int64 |
24831 ns |
24711 ns |
1.00 |
array/random/rand!/Float32 |
8865.666666666666 ns |
10187 ns |
0.87 |
array/random/rand!/Int64 |
21487 ns |
21588 ns |
1.00 |
array/random/randn/Float32 |
40965 ns |
43197 ns |
0.95 |
array/random/randn!/Float32 |
30680 ns |
30898 ns |
0.99 |
array/reductions/mapreduce/Float32/1d |
34511.5 ns |
34650 ns |
1.00 |
array/reductions/mapreduce/Float32/dims=1 |
39289.5 ns |
39535.5 ns |
0.99 |
array/reductions/mapreduce/Float32/dims=1L |
51159 ns |
51174 ns |
1.00 |
array/reductions/mapreduce/Float32/dims=2 |
56237 ns |
56317 ns |
1.00 |
array/reductions/mapreduce/Float32/dims=2L |
69047.5 ns |
69320 ns |
1.00 |
array/reductions/mapreduce/Int64/1d |
42153 ns |
42198 ns |
1.00 |
array/reductions/mapreduce/Int64/dims=1 |
41676 ns |
41733 ns |
1.00 |
array/reductions/mapreduce/Int64/dims=1L |
86837 ns |
86905 ns |
1.00 |
array/reductions/mapreduce/Int64/dims=2 |
59261.5 ns |
59221 ns |
1.00 |
array/reductions/mapreduce/Int64/dims=2L |
84209 ns |
84434 ns |
1.00 |
array/reductions/reduce/Float32/1d |
34888.5 ns |
34198.5 ns |
1.02 |
array/reductions/reduce/Float32/dims=1 |
39923.5 ns |
39152.5 ns |
1.02 |
array/reductions/reduce/Float32/dims=1L |
51228 ns |
51208.5 ns |
1.00 |
array/reductions/reduce/Float32/dims=2 |
56578 ns |
56419 ns |
1.00 |
array/reductions/reduce/Float32/dims=2L |
69443.5 ns |
69565 ns |
1.00 |
array/reductions/reduce/Int64/1d |
42143 ns |
42368 ns |
0.99 |
array/reductions/reduce/Int64/dims=1 |
43281 ns |
50017 ns |
0.87 |
array/reductions/reduce/Int64/dims=1L |
86851 ns |
86919 ns |
1.00 |
array/reductions/reduce/Int64/dims=2 |
59536 ns |
59687 ns |
1.00 |
array/reductions/reduce/Int64/dims=2L |
84145 ns |
84484 ns |
1.00 |
array/reverse/1d |
17588 ns |
17716 ns |
0.99 |
array/reverse/1dL |
68190 ns |
68268 ns |
1.00 |
array/reverse/1dL_inplace |
65897 ns |
65642 ns |
1.00 |
array/reverse/1d_inplace |
8678 ns |
10197.333333333334 ns |
0.85 |
array/reverse/2d |
20759 ns |
20523 ns |
1.01 |
array/reverse/2dL |
72864 ns |
72523 ns |
1.00 |
array/reverse/2dL_inplace |
65669 ns |
65706 ns |
1.00 |
array/reverse/2d_inplace |
9871 ns |
9831 ns |
1.00 |
array/sorting/1d |
2734974.5 ns |
2735407.5 ns |
1.00 |
array/sorting/2d |
1068342 ns |
1068528 ns |
1.00 |
array/sorting/by |
3304444 ns |
3304139 ns |
1.00 |
cuda/synchronization/context/auto |
1168.2 ns |
1165.7 ns |
1.00 |
cuda/synchronization/context/blocking |
889.96875 ns |
876.8679245283018 ns |
1.01 |
cuda/synchronization/context/nonblocking |
6929 ns |
6853.1 ns |
1.01 |
cuda/synchronization/stream/auto |
1036.0666666666666 ns |
1035.7142857142858 ns |
1.00 |
cuda/synchronization/stream/blocking |
798.0410958904109 ns |
789.9066666666666 ns |
1.01 |
cuda/synchronization/stream/nonblocking |
7090.9 ns |
7513.5 ns |
0.94 |
integration/byval/reference |
143718 ns |
143670 ns |
1.00 |
integration/byval/slices=1 |
145698 ns |
145632 ns |
1.00 |
integration/byval/slices=2 |
284371 ns |
284397 ns |
1.00 |
integration/byval/slices=3 |
422954.5 ns |
423006 ns |
1.00 |
integration/cudadevrt |
102328 ns |
102385 ns |
1.00 |
integration/volumerhs |
23416584 ns |
23431934.5 ns |
1.00 |
kernel/indexing |
13074 ns |
13020 ns |
1.00 |
kernel/indexing_checked |
13847 ns |
13828 ns |
1.00 |
kernel/launch |
2238.4444444444443 ns |
2128.1111111111113 ns |
1.05 |
kernel/occupancy |
881.2278481012659 ns |
701.7571428571429 ns |
1.26 |
kernel/rand |
14124 ns |
14086 ns |
1.00 |
latency/import |
3816521940 ns |
3826333658 ns |
1.00 |
latency/precompile |
4580398217.5 ns |
4608074622.5 ns |
0.99 |
latency/ttfp |
4397684562.5 ns |
4404025901.5 ns |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## master #3074 +/- ##
==========================================
- Coverage 16.59% 16.58% -0.02%
==========================================
Files 120 120
Lines 9586 9586
==========================================
- Hits 1591 1590 -1
- Misses 7995 7996 +1 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.